Performance and Fault Tolerance of Preconditioned Iterative Solvers on Low-Power ARM Architectures

نویسندگان

  • José Ignacio Aliaga
  • Sandra Catalán
  • Charalampos Chalios
  • Dimitrios S. Nikolopoulos
  • Enrique S. Quintana-Ortí
چکیده

As the complexity of computing systems grows, reliability and energy are two crucial challenges that will demand holistic solutions. In this paper, we investigate the interplay among concurrency, power dissipation, energy consumption and voltage-frequency scaling for a key numerical kernel for the solution of sparse linear systems. Concretely, we leverage a task-parallel implementation of the Conjugate Gradient method, equipped with an state-of-the-art preconditioner embedded in the ILUPACK software, and target a low-power multicore processor from ARM. In addition, we perform a theoretical analysis on the impact of a technique like Near Threshold Voltage Computing (NTVC) from the points of view of increased hardware concurrency and error rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reliability and Performance Evaluation of Fault-aware Routing Methods for Network-on-Chip Architectures (RESEARCH NOTE)

Nowadays, faults and failures are increasing especially in complex systems such as Network-on-Chip (NoC) based Systems-on-a-Chip due to the increasing susceptibility and decreasing feature sizes. On the other hand, fault-tolerant routing algorithms have an evident effect on tolerating permanent faults and improving the reliability of a Network-on-Chip based system. This paper presents reliabili...

متن کامل

Parallel Iterative Solvers for Ill-Conditioned Problems with Reordering

1. Preconditioned Iterative Solvers with Multicoloring In the previous work [1], author developed an efficient parallel iterative solver for finite-element applications on the Earth Simulator (ES) [2] using multi-level hybrid parallel programming model with MPI and OpenMP. The method employs three-level hybrid parallel programming model for SMP cluster architectures, consisting of MPI, OpenMP a...

متن کامل

A backward/forward recovery approach for the preconditioned conjugate gradient method

Several recent papers have introduced a periodic verification mechanism to detect silent errors in iterative solvers. Chen [PPoPP’13, pp. 167–176] has shown how to combine such a verification mechanism (a stability test checking the orthogonality of two vectors and recomputing the residual) with checkpointing: the idea is to verify every d iterations, and to checkpoint every c × d iterations. W...

متن کامل

CAFT: Cost-aware and Fault-tolerant routing algorithm in 2D mesh Network-on-Chip

By increasing, the complexity of chips and the need to integrating more components into a chip has made network –on- chip known as an important infrastructure for network communications on the system, and is a good alternative to traditional ways and using the bus. By increasing the density of chips, the possibility of failure in the chip network increases and providing correction and fault tol...

متن کامل

Enhancing Parallelism in Monte Carlo Techniques for Solving Large Sparse Linear Systems

The problem of solving large scale sparse linear systems arises in many scientific and engineering applications. Recent advances in multicore processors and clusters that consist of hundreds of thousands of cores motivate new techniques to solve such problems efficiently. Two main design considerations are the accuracy of the solution and the scalability of the method, which compete against eac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015